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ABSTRACT Individual kernel weight is an important trait for maize yield determination. We have identified KEYWORDS 

genomic regions controlling this trait by using the B73xMo17 population; hov\/ever, the effect of genetic kernel weight 

background on control of this complex trait and its physiological components is not yet known. The objective of kernel growth 
this study was to understand how genetic background affected our previous results. Two nested stable rate 

recombinant inbred line populations (N209xMo17 and R18xMo17) were designed for this purpose. A total of grain-filling 
408 recombinant inbred lines were genotyped and phenotyped at two environments for kernel weight and five duration 

other traits related to kernel growth and development. All traits showed very high and significant (P < 0.001) genetic 
phenotypic variability and medium-to-high heritability (0.60-0.90). When N209xMo17 and R18xMo17 were background 
analyzed separately, a total of 23 environmentally stable quantitative trait loci (QTL) and five epistatic inter- effects 

actions were detected for N209xMo17. For R18xMo17, 59 environmentally stable QTL and 17 epistatic inter- complex traits 

actions were detected. Ajoint analysis detected 14 stable QTL regardless of the genetic background. Between Multiparent 
57 and 83% of detected QTL were population specific, denoting medium-to-high genetic background effects. Advanced 
This percentage was dependent on the trait. A meta-analysis including our previous B73xMo17 results iden- Generation 
tified five relevant genomic regions deserving further characterization. In summary, our grain filling traits were Inter-Cross 
dominated by small additive QTL with several epistatic and few environmental interactions and medium-to-high (MAGIC) 

genetic background effects. This study demonstrates that the number of detected QTL and additive effects for multiparental 
different physiologically related grain filling traits need to be understood relative to the specific germplasm. populations 

MPP 



Improving yield is the main objective of breeding programs in grain 
crops. For the majority of the twentieth century, maize grain yield was 
improved by applying selection techniques over phenotypic measure- 
ments (HaUauer and Miranda 1988). Since 1980, detection of quantita- 
tive trait loci (QTL) combining phenotypic information with molecular 
marker data has received considerable attention (Bernardo 2008). 
A large number of studies identifying molecular markers linked to 
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QTL involved in the inheritance of agronomically relevant traits such 
as grain yield have been described (Bernardo 2008). 

Grain yield is the final outcome of many processes occurring 
throughout the growing season. Qne avenue to investigating the genetic 
basis of yield is to dissect it into individual yield components and then 
search for their specific genetic basis (Slafer 2003). Maize grain yield 
can be described as a function of the number of harvested kernels and 
their individual weight. Qf these two components, kernel number is 
usually the one that explains most variation; however, both compo- 
nents affect final yield (Borras and Gambin 2010). Kernel weight (KW) 
is a highly heritable trait (Sadras 2007; Alvarez Prado et al. 2013a), 
varying markedly among genotypes (Reddy and Daynard 1983). 

The determination of KW is generally described by traits related to 
dry matter and water content accumulation (Schnyder and Baum 
1992; Borras et al. 2003; Bingham et al 2007; Rondanini et al. 2007) 
and is commonly divided into three phases: the lag phase, the effective 
grain-filling period, and the maturation drying phase (Figure 1; Bewley 
and Black 1985). The lag phase is a period of active cell division 
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characterized by water content increases with almost no dry matter 
accumulation. The effective grain-filUng period is characterized by rapid 
dry matter accumulation at a constant rate resulting from the deposition 
of reserves. Most genotypic differences in KW are related to changes in 
the kernel growth rate (KGR) around this period. KGR is very depen- 
dent on the sink capacity established early in grain fiUing and can be 
estimated with the kernel maximum water content (MWC; Figure ID). 
Moisture concentration (MC) within kernels is reduced throughout 
grain fiUing (Figure IC). At a particular critical MC biomass deposition 
stops and total grain-fiUing duration (GFD) is established. This moment 
is known as physiological maturity (Shaw and Loomis 1950). As such, 
GFD depends on the rate of kernel desiccation (KDR) and the MC that 
each specific genotype attains physiological maturity (MCPM; Figure 
ID). All these traits vary among exotic and elite germplasm (Borras 
et al. 2009), and we are interested in studying their genetic basis. 

Several studies on QTL mapping for maize KW have been con- 
ducted, and inconsistent results in terms of locahzation and effect size 
were obtained (Schon et al. 1994; Austin and Lee 1996, 1998; Frova 
et al. 1999). The lack of consistency could be related to the complexity 
of the trait, needing further dissection into simpler components. KW 
is commonly dissected in its physiological components KGR and 
GFD. These traits are governed by different physiological mechanisms 
(Borras and Gambin 2010). They also are genetically independent 
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Figure 1 Schematic figure describing phenotypic grain-filling traits of 
interest: (A) kernel weight (KW), kernel growth rate (KGR), and grain- 
filling duration (GFD); (B) maximum water content (MWC); (C) moisture 
concentration at physiological maturity (MCPM) and kernel desiccation 
rate during the effective grain-filling period (KDR); and (D) conceptual 
representation of trait hierarchy and correlated physiological mecha- 
nisms. See the section Materials and Methods for details regarding the 
measurement of each specific trait. Figure was adapted from Alvarez 
Prado et al., (2013b). 



traits as genomic regions associated with their determination do not 
colocaUze (Alvarez Prado et al 2013b). Depending on the specific germ- 
plasm used at each study, KW variability could be related to differences 
in GFD or KGR only (Figure ID). These differential mechanisms 
behind genetic differences in KW can generate inconsistent QTL 
localizations. 

Most previous studies dealing with QTL and KW determination 
have been conducted using different individual biparental populations. 
At these populations, only two alleles at any given locus are simulta- 
neously tested, without representing the genetic variability of the species 
(Holland 2007). Linkage mapping based on biparental populations can 
only identify QTL from the phenotypic diversity generated from the 
controlled cross. Use of multiple-cross mating designs sharing the same 
(Yu et al 2008; Li et al. 2011) or different parents (Kraakman et al 
2004; Blanc et al 2006; Verhoeven et al. 2006) enable higher power and 
resolution through joint linkage and association analyses. Statistical 
methods are currenfly available to correctly analyze connected popula- 
tions (Jannink 2007; Malosetti et al 2007; Li et al 2011). 

The objective of our study was to unravel the genetic architecture 
of maize KW determination by using a multiple-parental population 
generated from contrasting parental lines for KW and its physiological 
components. We were specifically interested in using a connected 
population because our final trait of interest (ICW) is determined through 
different physiological mechanisms. Testing a larger range of variabil- 
ity than commonly explored in biparental populations seemed critical 
for our needs. 

MATERIALS AND METHODS 

Plant material and experimental design 

Lines Mol7 (PI 558532), N209 (PI 595366), and R18 (PI 340856) were 
selected as parental lines for recombinant inbred line (RIL) population 
development. Selection of parental lines was based on their variability 
for grain-filling patterns (Borras et al 2009). We used Mol7 as the com- 
mon parent because we also had available the RIL population B73xMol7 
(IBM Syn4). 

Mo 17 is a yellow endosperm maize inbred with a red cob developed 
from the cross CI187-2xC103 (Zuber 1973). This inbred has a KW of 
c.a. 280 mg kernel" \ determined by a high KGR and a moderate GFD 
(Borras et al 2009). N209 is a yellow dent maize inbred developed 
directly from the NSSI (6), the sixth cycle of per se selection in the 
Nebraska Stiff Stalk Synthetic (Kaeppler et al. 1997). This inbred shows 
an average KW (c.a. 260 mg kernel"') determined by a long GFD and 
a moderate KGR. Inbred R18 was derived from the Supergold popu- 
lation (www.ars-grin.gov) with red pericarp. This inbred has a very 
small KW (c.a. 100 mg kernel"') due to a reduced KGR and GFD. 

Fi crosses R18xMol7 and N209xMol7 were made at the Agronomy 
Department of Iowa State University, Ames, Iowa, during 2006/07 
(Table 1) with parental seeds obtained from the USDA (www.ars- 
grin.gov). A single Fi plant was used to generate the F2 at Brunner 
Farm, Iowa State University, during 2007 (Table 1). These F2 were 
advanced to the Fg generation by single- seed descent and multiplied 
after. Generations were done in the United States and Argentina 
(Table 1). For this study, we used 204 RILs from each population 
(R18xMol7 and N209xMol7). Depending upon availability 10-15K 
per RIL is available for free by contacting Dr. L. Borras (lborras@unr. 
edu.ar). Shipping costs must be covered by the requester. 

Field experiments were conducted during 2010/2011 (hereafter 
2010) and 2011/2012 (hereafter 2011) growing seasons at the Campo 
Experimental ViUarino, Facultad de Ciencias Agrarias, Universidad 
Nacional de Rosario, at ZavaUa, Argentina. Each experiment was 
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■ Table 1 General description of the N209xMo17 and R18xMo17 
mapping population development 



Growth 



Generation 


Season 


Conditions 


Location 


Fi 


2006/07 


Greenhouse 


Ames, Iowa 
(USA) 


F2 


2007 


Field 


Ames, Iowa 
(USA) 


F3 


2007/08 


Field 


Buenos Aires 
(ARG) 


F4 


2008 


Field 


Slater, Iowa 
(USA) 


F5 


2008/09 


Field 


Santa Isabel, 
Santa Fe (ARG) 


F6 


2009/10 


Field 


Venado Tuerto, 



Santa Fe (ARG) 

ARG, Argentina. 

arranged in a randomized complete block design with three replicates. 
Planting was October 4, 2010, and September 22, 2011. A stand 
density of 7 pi m~^ was used for both experiments. Plots were over- 
planted and thinned at V2 and were one row 0.52 m apart and 5.5 m 
long. 

Experiments were conducted without water limitations using 
a sprinkler irrigation system. Pests and diseases were controlled by 
spraying commercially recommended fungicides and insecticides, and 
weeds were periodically removed by hand. 

Phenotypic measurements and analysis 

Kernel dry matter and water content were measured throughout kernel 
development beginning 15 d after each plot reached an thesis and until 
harvest maturity (15% kernel MC) following procedures described in 
Borras et al. (2003). To summarize, one plant per plot was sampled 
every 4 or 5 d between 07:00 and 10:00 am. The entire ear with sur- 
rounding husks was immediately enclosed in an airtight plastic bag 
and transported to the lab. Kernels were removed from the ear at floret 
positions 10—15 from the bottom of the rachis within a humidified 
box. Ten kernels per ear were sampled on each date. Fresh weight was 
measured immediately after sampling, and kernel dry weight was de- 
termined after drying samples at 70° for at least 96 hr. Fresh and dry 
weight were used to calculate kernel water content and kernel MC. 

KGR and GFD were determined for each genotype x replicate 
combination by fitting a bilinear model (Equation [1] and [2]) as in 
Borras et al. (2009) (Figure lA): 

KW = fl + b TT for TT < c (1) 

KW = fl + foe for TT > c (2) 

where TT is the number of heat units after pollination (°Cday), a is 
the jz-intercept (mg kernel" 0, fo is the KGR during the effective grain- 
filling period (mg °Cday"^), and c is the GFD (°Cday) (Figure lA). 
Heat units were calculated using 0° as base temperature (Muchow 
1990; Borras et al. 2009). Mean daily air temperature was registered at 
a weather station located approximately 150 m from the experimental 
plots. 

Kernel MWC was determined for each genotype x replicate 
combination by fitting a curvilinear model (Equation [3]) as in Borras 
et al. (2009) (Figure IB): 

WC = d + eTT+/TT^-^+^TT2 (3) 



where WC is kernel water content and d, e, /, and g are model 
parameters. 

Kernel MCPM was determined using a bilinear model relating 
kernel dry weight and kernel MC data (Equation [4] and [5]) following 
Borras et al. (2009): 

YM = h- iMC for MC >; (4) 

KW = h-ij for MC <j (5) 

where MC is moisture concentration (%), h is the jz-intercept 
(mg kernel"^), / is the rate of kernel MC decline during grain filling 
(mg kernel"' [%]"'), and; is the MCPM (%). This model was fitted 
for each genotype x replicate combination. 

KDR was determined using a linear regression model fitted for 
each genotype x replicate combination relating kernel MC and thermal 
time from pollination to physiological maturity: 

MC = + ; TT (6) 

where MC is kernel MC (g kg"'), k is the /-intercept (g kg"'), and 
/ is the KDR (g kg"' °Cday"'). 

AU curves were fitted using the iterative optimization technique of 
GraphPad Prism V5.0 (Raduschev 2007). 

Phenotypic variance was partitioned into genetic and environmen- 
tal components by using a mixed model for each population and trait 
separately. The model included environments (years), blocks nested 
within environments, genotypes and genotype x environment inter- 
action. AU model factors were considered as random. Proc MIXED 
using REML method from SAS statistical package was used for the 
analysis (SAS Institute 1999). 

Broad sense heritabUity of each trait was calculated on a mean 
basis as: 

= ^1/ H + Ke/-^) (7) 

where ct^g is the genotypic variance, ct^ge is the genotype x envi- 
ronment variance, a^j, is the plot residual variance, and t| and r are 
the number of environments and replication plots, respectively (Holland 
et al. 2003). 

For QTL analysis, phenotypic data were analyzed by using a dif- 
ferent mixed model approach following Malosetti et al. (2008). The 
midtitrait midtienvironment model was performed on each population 
separately where the data set consisted of I genotypes evaluated in 
/ environments with measurements on K traits (in our study, I = 204 
for each population, / = 2, and K= 6). Define an Af x 1 vector "y," with 
N = IJK that contains all the observations sorted by trait within envi- 
ronment for each population. Given that the interest is in the genetic 
variation within the population rather than the genotypes themselves, 
we assumed genotypes to be random. Trait-environment (TE) combi- 
nations and blocks nested within TE combination were considered as 
fixed. The mixed linear model for multitrait midtienvironment data 
were as follows: 

y = Xfi + Zu + E (8) 

where y is the vector of phenotypic observations, /3 is a vector of 
fixed effects due to the TE combination and blocks within TE, while 
u and g are the vectors of random effects due genotypes and residuals, 
respectively. X and Z are incidence matrices of Is and Os associated 
to fixed and random effects, respectively. Vector /3 contains the trait 
means within environments across genotypes and the blocks means 
within TE combination. Vector u denotes the random genotypic effects 
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per TE combination. Random genetic effects were assumed to be 
normally distributed « ~ N (0, G), being the G matrix a block diagonal 
with blocks of 12 x 12 variance-covariance parameters. Finally, e is 
a vector of non-genetic residuals associated with each observation and 
normally distributed e~ N (0, R) being R the residual variance (-fa^yk). 
The phenotypic (co)variance was: 

V(y) = ZGZ' + R (9) 

Different variance-covariance structures for the G matrix were 
assimied to select the most suitable and parsimonious for our data 
set. AU assumed structure models considered the factorial combina- 
tion of traits and environments, interpreting each TE combination as 
a trait by itself Evaluated structures used different parameter number 
for fitting data. Thus, the choice of the best model for our data was 
based on a goodness of fit criterion such as the Bayesian Information 
Criterion (Schwarz 1978). Compound symmetry model was selected 
as the best model for both populations. Considering this variance- 
covariance structure, best linear unbiased predictor of each genotype 
in each environment was estimated for reducing uncontrolled trait 
variation for QTL mapping (Alvarez Prado et al. 2013b). 

Genetic map construction and QTL analysis 

The 408 stable recombinant inbred lines (RILs) and the parental lines 
of each population were genetically characterized with 500 pubhc single- 
nucleotide polymorphisms. A total of 127 and 130 single-nucleotide 
polymorphisms (Supporting Information, File SI) were polymorphic 
and showed no segregation distortion for populations R18xMol7 
and N209xMol7, respectively. Linkage analysis was carried out using 
JoinMap v4 (Van Ooijen 2006). Individual genetic maps for each pop- 
ulation and a consensus genetic map were constructed. Map distances 
were computed with the Haldane mapping fianction (Haldane 1919). 

Statistical models for QTL analysis at individual populations: For 

individual populations, a QTL analysis was carried out using the in- 
clusive composite interval mapping (ICIM) proposed by Li et al. (2007) 
using MET option from QTL IciMapping software (available from www. 
isbreeding.net). As composite interval mapping, ICIM method uses a 
linear regression model: 

m+1 

yi = bo + Y^ bjXij + et, (10) 

;=i 

where yi is the trait value of the ith individual in the mapping pop- 
ulation; bo is the overall mean of the model; Xy is a dummy variable 
for the genotype of the !th individual at the jth marker; bj is the re- 
gression coefficient of the phenotype on the jth marker conditional on 
all other markers; and e,- is the residual error that is assumed normally 
distributed. 

When building the model (10), ICIM uses all marker information 
by selecting the most important markers and setting to zero the 
unselected ones through a stepwise regression. In order to avoid over 
fitting a probability of 0.0001 was used (Li et al. 2008). Thus, for 
a testing position in a specific interval {k,k+ 1) the phenotypic values 
from model (10) are adjusted by: 

j^k. k+l 

where bj is the estimate of bj in model (10). Phenotypic values were 
adjusted by all markers retained in the regression equation except 



the two markers flanking the current interval. The adjusted pheno- 
type (zly,) contains the position and additive effect information of 
the QTL in the current testing position and excludes the influence of 
the QTL located on other interval or other chromosomes (Wang 
et al. 2012). A logarithm of odds (LOD) score of 2.5 and a step size 
of 2 cM were used for QTL analysis. 

Digenic epistasis of QTL was estimated using the linear regression 
model developed by Li et al. (2008) in QTL IciMapping software: 

m+1 

yi = bo+ ^ bjXij + ^ bjkXijXik + ei, (12) 

j=ij<k 

where y,- is the phenotypic value of the trait for the ith individual in 
the mapping population; bg is the overall mean; x^ is a dummy 
variable for the genotype of the ith individual at the jth marker; 
bj is the partial regression coefficient of the phenotype on the jth 
marker variable; fo^j. is the partial regression coefficient of the phe- 
notype on the multiplication variable of the jth and kth markers; and 
6; is the residual error that is assumed normally distributed. 

A two-stage stepwise regression strategy was adopted for param- 
eter estimation. First, significant markers were selected. Then stepwise 
regression with a probability of 0.0001 was applied to the residuals 
from the first stage to select significant markers pairs and estimate 
thefr effects in the model (Li et al. 2008): 

^yi=yi-'Yl ~ ^rsXirXis + 6; , (13) 

r^j.j+l,k,k+i r^jj+t 
s^k.k+\ 

where br and fo„ are the estimates of b,. and in model (12), re- 
spectively. The adjusted phenotype Ayi thus obtained contains the 
information of QTL in the two testing intervals including two posi- 
tions, two additive effects of individual QTL, and one epistatic effect 
between the two QTL. A LOD score of 5 and a step size of 10 cM for 
improving detection accuracy were used for epistasis detection (Zhang 
et al. 2012). 

Statistical models for joint QTL mapping: For statistical analysis of 
combined populations we used joint ICIM, developed by Li et al. 
(2011). The joint ICIM method is similar to model (11) but includes 
a population effect term: 

^y/f = yif - "if^f '^j^'fj' (^^) 

jj^k, k+l 

where Uf is the background effect of each founder with the common 
parent for the jth population and a,^ is a dummy variable for the 
genotype of the ith individual at the^th population. Because equa- 
tions (11) and (14) differed only in the background effect (uf), this 
was estimated as the proportion of QTL number difference between 
QTL analysis of individual populations and that of joint populations. 

Meta QTL analysis: To study QTL congruency a meta QTL analysis 
including results from the two evaluated populations and the IBM 
(B73xMol7) Syn-4 population (Alvarez Prado et al. 2013b) was per- 
formed using Biomercator v3 (Sosnowski et al. 2012). We conducted 
a meta QTL analysis instead of a joint QTL analysis because evaluated 
populations shared only one of the two tested envfronments and 
because B73xMol7 shared very few molecular markers with the other 
two populations. Meta QTL analysis uses a two-stage procedure to in- 
tegrate multiple independent QTL experiments. The first step consisted 
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Figure 2 Multienvironmental quantitative trait loci (QTL) mapping results 
for kernel weight, kernel growth rate, grain-filling duration, maximum water 
content, kernel desiccation rate, and moisture concentration at physiolog- 
ical maturity for R18xMo17 and N209xMo17 populations, respectively. 
Black lines represent logarithm of odds (LOD) profile for additive QTL. Red 
lines represent LOD profile for additive QTL x environment. Dotted lines 
represent LOD threshold of 2.5. 



of integrating individual genetic maps into a consensus map. For con- 
necting maps, the procedure needed at least two common markers per 
chromosome between maps. In those cases in which there were fewer 
than two common markers per chromosome between two maps, marker 
physical positions (http://www.maizegdb.org) were compared and 
those markers showing to be in tight linkage (overlapped positions) 
were considered as synonyms for the purpose of the meta QTL anal- 
ysis. Genetic map from N209xMol7 was used as reference map be- 
cause it had more markers in common with remaining populations. 

The second step consisted on projecting QTL from individual 
studies on the consensus map. By assuming there are a finite number 
of true QTL locations, the meta QTL algorithm performed a clustering 
approach based on a Gaussian-mixture distribution to both classify the 
observed QTL and estimate its position (Veyrieras et al. 2007). We 
later continued testing all possible QTL combinations and then choos- 
ing the one that maximizes a penalized log-likehhood. To select the 
number of meta QTL, the software provides five different criteria for 



model choice (Veyrieras et al. 2007). In this study, Akaike Information 
Criterion (Akaike 1974) was used to select final number of meta QTL. 

RESULTS 

Phenotypic results 

Phenotypes from both RIL populations showed narrow differences in 
time to silking. The accimiulated thermal time from planting to the first 
flowering line and to the last flowering line were 951 to 1269 °Cday 
(representing 19 d) and 985 to 1286 °Cday (18 d) for R18xMol7 and 
N209xMol7, respectively, in 2010. The accimiulated thermal times 
were from 1107 to 1428 °Cday (20 d) and 1107 to 1444 °Cday (21 d) 
for R18xMol7 and N209xMol7, respectively, in 2011. Most impor- 
tant, phenotypic variation in time to silking was not associated with 
any grain-filling trait (P > 0.05). 

The coefficient of variation for all phenotyped traits ranged from 
7 to 13% (Table 2), allowing a precise estimation of variance compo- 
nents and heritability. Designed populations explored a wide range of 
phenotypic values for aU grain-filling traits (Table 2). Approximately 
73 and 49% of KW variation in R18xMol7 and N209xMol7, respec- 
tively, was genotypic variation (Table 2). 

Phenotypic variance of KGR was mostiy influenced by genotype 
and genotype x environment components (Table 2) showing mediimi 
(0.61) to high (0.80) heritability in N209xMol7 and R18xMol7 pop- 
ulations, respectively (Table 2). As expected, KW was positively cor- 
related with KGR in both R18xMol7 (r = 0.81, P < 0.001) and 
N209xMol7 (r = 0.55, P < 0.001) populations; genotypes with greater 
KGR showed greater KW. 

Genotypic variance of GFD was significant at both populations 
(P < 0.05) but in population N209xMol7 was (13%) lower than the 
genotype x environment interaction (26%) and residual variances (51%), 
leading to a low heritability estimate (0.37; Table 2). In population 
R18xMol7, genotypic variance (21%) was also lower than the residual 
variance (55%; Table 2). KW was significantiy (P < 0.001) and posi- 
tively correlated with GFD for both R18xMol7 (r = 0.60) and 
N209xMol7 (r = 0.69), genotypes with longer GFD showed greater KW. 

KGR is physiologically related to kernel MWC attained at mid- 
grain filling (Figure 1). For both populations, genotypic variance of 
MWC was the greatest magnitude component of the phenotypic var- 
iance (79 and 64% for R18xMol7 and N209xMol7, respectively), 
leading to very high heritability estimates (Table 2). KGR was signif- 
icantly (P < 0.0001) and positively correlated with variations in MWC 
for both R18xMol7 (r = 0.83) andN209xMol7 (r = 0.55) populations. 

The duration of the grain filling period depends on how quickly 
each genotype attains a specific MCPM maturity through the KDR, 
Estimated residual variance was approximately 60% for both pop- 
ulations, which was greater than genotypic variance (29 and 26% for 
R18xMol7 and N209xMol7, respectively; Table 2). GFD was nega- 
tively correlated with variations in KDR for N209xMol7 (r = —0.46, 
P < 0.001) and R18xMol7 (r = -0.25, P < 0.001), respectively. 

Similar to KDR, residual variance for MC at physiological maturity 
was the greatest variance component for this trait, followed by the 
genotypic, genotype x environment, and environmental variances. 
Heritability estimates were similar to those for KDR (Table 2). GFD 
was negatively correlated with MCPM variations in the N209xMol7 
(r = -0.41, P < 0.001) and R18xMol7 (r = -0.31, P < 0.001) 
populations. Genotypes with shorter GFD reached physiological ma- 
turity with greater MC. 

In summary, all evaluated traits in the multiparental RIL populations 
showed significant differences, creating an ample phenotypic variability 
for QTL detection. 



1616 I S. Alvarez Prado et a/. 



:-5iG3'Genes | Genomes | Genetics 



■ Table 3 QTL for kernel weight, kernel growth rate, grain-filling duration, maximum water content, and moisture concentration at PM 
identified in the connected RIL populations using joint inclusive composite interval mapping 











Additive Effects 2010 


Additive Effects 201 1 


Trait 


Chrom. 


Left Marker 


Right Marker 


N209xMo17 


R18xMo17 


N209xMo17 


R18xMo17 


Kernel weight 


1.01 


MAIZE.5255.C5 


DT272SNP179 


13.6 


4.7 


— 


— 




1.05 


BG549215 


ZM004621 


10.0 


10.9 


7.0 


15.9 




1.11 


U10418 


TC3 10451 


5.8 


9.3 


- 


- 




3.06 


TAQKWS4441 


AY1 04231 


— 


— 


7.7 


8.3 




5.05 


CT08SNP271 


ZM011159_18 


-17.5 


4.9 


-14.4 


6.1 




9.05 


TC306801 


CL400916 


— 


- 


4.5 


11.0 




10.03 


MAIZE.4293.C1 


AJ400868 


14.1 


5.2 


13.7 


5.9 


Kernel growth rate 


1.03 


OS011512 


AY1 07709 


0.016 


-0.081 


— 


— 




1.11 


U10418 


TC3 10451 


0.003 


0.023 


0.008 


0.017 




5.02 


X73980 


HHUVIB12 


— 


- 


-0.005 


0.017 




9.05 


TC306801 


CL400916 




— 


0.007 


0.021 




10.03 


MAIZE.4293.C1 


AJ400868 


0.019 


0.012 


— 


— 


Grain-filling duration 


1.05 


BG549215 


ZM004621 


18.0 


17.1 


24.6 


20.1 




7.02 


PM01 56-620 


AY106713 


-34.5 


9.2 


-27.9 


7.0 




8.01 


AY1 05770 


MAIZE.264.C1 


-24.0 


14.4 


— 


— 


Maximum water content 


3.04 


ZM011693 


DTI 34 


-11.6 


2.7 


-12.7 


3.0 




4.07 


AY1 04901 


BZ534879.1 


12.9 


6.4 


13.8 


5.2 




5.02 


X73980 


HHUVIB12 


— 


- 


3.9 


10.9 




5.05 


CT08SNP271 


ZM011159_18 


-9.5 


6.5 


-6.8 


8.4 




9.05 


TC306801 


CL400916 


7.6 


8.0 


10.2 


12.1 


Kernel desiccation rate 


1.04 


AY1 06901 


ZM010356 


-0.023 


-0.011 


— 


— 




1.06 


AY1 03580 


AY105791_1 


-0.022 


-0.001 


-0.019 


-0.004 




2.01-2.02 


SM0017D 


AZM4_1 02791 


0.011 


0.031 


0.011 


0.022 




5.05 


CT08SNP271 


ZM01 1 1 59_1 8 


0.019 


0.015 








7.03-7.04 


AY106713 


DTI aSSNP437 


0.019 


0.005 


0.020 


0.011 


Moisture concentration at PM 


1.06 


AY105791_1 


AY1 03863 


-0.4 


1.4 








2.08 


DR791509 


AZM4_1 25066 


1.5 


0.8 








4.08 


BZ534879.1 


BH417806 


1.1 


1.1 


1.2 


1.0 




7.02 


MAGI_19986_1 


ZM004923 


2.0 


-0.9 


1.8 


0.4 




9.05 


TC306801 


AY105818 


0.2 


1.4 


0.8 


1.8 


QTL, quantitative trait loci; PM, physiological maturity, RIL, recombinant 


inbred line. 











QTL mapping for individual RIL populations 

Individual populations were evaluated under two environments to test 
for environmentally stable QTL. A total of 23 and 59 environmentally 
stable QTL were detected along the 10 chromosomes in N209xMol7 
and R18xMol7, respectively (Figure 2). The majority of detected QTL 
were of small effects, except for one QTL for MWC, which showed a 
relatively major effect (12%) at the N209xMol7 population (Table SI). 
In population R18xMol7, some relatively major QTL were detected 
for KGR, KW, MWC and MCPM explaining from 11 to 14% of the 
phenotypic variance (Table SI). 

Several colocalizations were detected among environmentally stable 
QTL for physiologically related traits. Two QTL for KW in N209xMol7 
population colocaUzed with a QTL for MWC on chromosome 3 and 
for KGR on chromosome 10 (Figure 2). Both detected QTL for GFD 
colocalized with QTL for MCPM. In population R18xMol7, all nine 
detected QTL for KW colocalized with QTL of other grain filling related 
traits. Seven QTL for KW colocalized with QTL for KGR and MWC. 
Three and two QTL for KW colocalized with QTL for MCPM and GFD 
on chromosomes 1, 7, 9, and 10 (Figure 2). 

A total of 5 and 17 pairs of markers with significant interaction 
effects were detected for N209xMol7 and R18xMol7, respectively 
(Table S2). For N209xMol7 population, all traits except KDR showed 
one significant epistatic interaction. Among all traits, only one of the 
five interactions involved loci which both had significant main effects. 
The remaining four interactions involved loci that were individually 
nonsignificant (Table S2). Individual epistatic interactions explained 



up to 5% of the phenotypic variation of KW, 13% for KGR, 10% for 
GFD, 15% for MWC, and 5% for MCPM (Table S2). For R18xMol7, 
five pairs of markers showed significant epistatic interactions for 
KW, two pairs for KGR and KDR, three pairs for MWC, three pairs 
for GFD, and one pair for MCPM. Among all traits, only two of the 
17 interactions involved loci that both had significant main effects. 
However, seven of the other interactions involved one locus that 
had a significant effect on the trait. The remaining eight inter- 
actions involved loci which were individually nonsignificant. In- 
dividual epistatic interactions explained from 3 to 14% of the 
phenotypic variation of KW, 10% for KGR, from 7 to 11% for 
GFD, from 5 to 12% for MWC, from 8 to 10% for KDR, and 7% 
for MCPM (Table S2). 

Joint-population QTL mapping 

After testing environmentally stable QTL for individual populations, 
we tested QTL stability across backgrounds by analyzing both con- 
nected populations at two growth environments. A total of 35 QTL 
were significant for joint multiparental population (Table 3). From 
total QTL, 16 regions associated with KW (3 QTL), KGR (1 QTL), 
GFD (2 QTL), MWC (4 QTL), KDR (3 QTL), and MCPM (3 QTL) also 
were stable across environments showing relatively consistent effects 
(Table 3). 

A total of 13 QTL were associated with KW (Table 3) explaining 
from 12 to 57% of the phenotypic variation across both populations. 
For both ICW components (KGR and GFD) seven and four QTL 
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Figure 3 Proportion of detected quantitative trait loci (QTL) in 
N209xMo1 7, R1 SxMol 7 and in joint populations for each trait at each 
environment. Differences between individual and joint population repre- 
sent the background effects. 



explaining between 15-24% and 13-46% for KGR and GFD were 
detected, respectively. Four QTL for KGR and one QTL for GFD colo- 
calized with KW QTL (Table 3). As expected, no QTL colocalization for 
GFD and KGR was detected (Table 3). 

For kernel water related traits 20 QTL were detected along the 
genome (Table 3) explaining 15-32%, 12-27%, and 12-32% of the 
phenotypic variation for MWC, KDR, and MCPM, respectively. For 
MWC, two QTL colocalized with its physiologically related trait KGR, 
and three QTL colocalized direcdy with QTL for KW (Table 3). No 
QTL for MCPM and KDR colocalized with a QTL for GFD (Table 3). 

When comparing QTL analysis for individual and joint popula- 
tions, we estimated background effects for each trait at each environ- 
ment (Figure 3). Between 57 and 83% of detected QTL were population 
specific, denoting medium-to-high genetic background effects. GFD 
and MWC were the traits with the lowest genetic background influence, 
as 43-33% and 36-38% of detected QTL were common for both pop- 
ulations, respectively (Figure 3). BCW and KGR showed a greater back- 
ground effect than GFD, showing 72 to 74% and 75% of the detected 
QTL to be popidation specific mostly associated with R18xMol7 pop- 
ulation (Figure 3). KDR and MCPM showed the greatest genetic back- 
ground effects with 80-83%, and 69-79% of the detected QTL being 
population specific for KDR and MCPM, respectively (Figure 3). As 



such, the different studied grain filling traits showed QTL results to be 
more population specific than others. 

Meta QTL analysis 

A meta QTL analysis was conducted in order to confirm QTL positions 
of the evaluated populations. For this we increased background vari- 
ability using results of environmentally stable QTL for all KW de- 
termination traits from the IBMSyn4 (B73xMol7) population from 
Alvarez Prado et al. (2013b). A total of 41 distinct QTL clusters (meta 
QTL) were identified when considering KW and its physiologically 
related traits (Figure 4). The nimiber of meta QTL ranged from none 
in chromosome 8 to seven in chromosome 1. Five meta QTL were 
considered relevant for further studying for our physiologically corre- 
lated traits. Those were located on chromosomes 5, 6, 9 (two meta 
QTL) and 10 (Figure 4). AH relevant meta QTL contained QTL asso- 
ciated with KW, KGR, and MWC, and two of them also showed QTL 
for MCPM and KDR. Meta QTL from chromosomes 6 and 10 were 
the most robust, as they contained QTL originally detected at all three 
populations (Figure 4). 

DISCUSSION 

Our study on the genetic basis controlling KW and its physiological 
mechanisms provided insight in the genetic architecture of an econom- 
ically relevant trait for maize grain yield determination. The relevance of 
our study is linking crop physiology mechanisms with genetics using 
a multiparental RIL population. Previous reports on multiparental RIL 
populations focused on simpler traits recorded at a fixed stage (Buckler 
et al. 2009; Peiffer et al. 2013); however, many traits have a temporal 
trajectory, and their final value is the consequence of many independent 
QTL expressed during ontogeny (Maloof 2003). In our study, we dis- 
sected a complex trait (KW) in its physiological components where dif- 
ferent mechanisms can explain changes in our final trait of interest. We 
studied the genetic architecture of an emergent trait (KW) through the 
determination of rate and duration of kernel growth (Figure 1; Yin et al. 
2004). Specific phenotyping protocols allowed us to precisely characterize 
a large nimiber of genotypes from contrasting genetic backgrounds for 
physiological traits related to kernel growth and development. 

Complex trait dissection in many species has largely relied on two 
main approaches, linkage analysis with biparental populations and 
linkage disequilibrium analysis with association panels. More recently, 
a maize-nested association mapping population (NAM) was de- 
veloped combining the favorable properties of both approaches cap- 
turing a wide range of maize genetic diversity, exploiting ancestral 
recombination, and having stifficient power to dissect the genetic 
architecture of complex traits (Yu et al. 2008; Buckler et al. 2009; 
Peiffer et al. 2013). In our study we developed a multiparental pop- 
ulation following the NAM concept. It was specifically designed for 
exploring different KW determination strategies and a genome scan 
with high power for detecting QTL with effects of different sizes (due 
to differences in parental inbreds). Parental inbreds were previously 
selected from a grain-filling pattern screening (Borras et al. 2009) and 
later used for this multiparental population development. This valu- 
able genetic resource showed an ample phenotypic variability, with 
changes in KW achieved through different combinations of rate and 
duration of kernel growth. 

Results from QTL analysis on our midtiparental population con- 
firm previous results using the IBM Syn4 (B73xMol7) population 
(Alvarez Prado et al. 2013b) suggesting that, for maize, the genetic 
architecture of KW determination is dominated by small additive 
QTL with several epistatic and medium to low environmental inter- 
actions. As expected for complex quantitative traits this study showed 
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Figure 4 Consensus map including N209xMo17, R18xMo17, and B73xMo17 populations. Meta quantitative trait loci (QTL) are represented with 
different colors within the chromosomes. Outside each chromosome there are individual QTL from each population. The color of the letters 
represents the population in which the QTL was detected and bar color the meta QTL to which each individual QTL belongs. 



evidence of genetic background effects for aU grain-filling traits. Back- 
ground effects are commonly observed across literature in maize 
(Mihaljevic et al. 2004; Li et al. 2009) and other plant species (Pilet 
et al. 2001; Xie et al. 2008; Zuo et al. 2013). According to Holland 
(2007), inconsistency of QTL across mapping populations is the result 
of genetic heterogeneity. When many genes altering different physi- 
ological mechanisms are controlling a specific trait, different subsets 
can segregate at different popidations (Holland 2007). In the present 
article, the magnitude of the genetic background affected each grain 
filling trait differently. When comparing background effects across 
traits, we found that GFD and MWC were the most stable as the 
result of changes in the genetic background followed, in descending 
order, by KW, MCPM, KGR, and KDR. We hypothesized that back- 
ground effects for our grain-filling traits may ascribe to the physio- 
logical complexity of the trait. Thus, traits with a greater hierarchy 
level (or those that are more complex; like KW) would be more 
affected than traits with lower hierarchy or are less physiologically 
complex (all other traits; see Figure IC); however, our results do not 



support this hypothesis; KW was among the less affected trait for 
background effects. 

In addition to results from Alvarez Prado et al. (2013b) we de- 
tected epistatic interactions with minor effects at both individual pop- 
ulations. The finding that epistatic interactions were detected between 
loci with significant main effects and loci without significant main 
effects may have implications on the genetic architecture of complex 
traits (Holland et al. 1997). The effects of QTL that exhibit interac- 
tions with unlinked genes may be altered dramatically when they are 
incorporated into a genetic backgroimd different from the one in 
which they were mapped. Our results partially supported this assump- 
tion as epistatic interactions were detected among markers for all 
evaluated traits which showed mediimi to high background effects, 
but no association was observed between the number of interactions 
and the magnitude of background effect. The high nimiber of detected 
epistasis could be associated with the nimiber of genes involved during 
the grain-filling period. Liu et al. (2008) described 3445 differentially 
expressed genes associated with processes like cell division, cell growth 
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and differentiation, starch, storage protein, fatty acids, and phytohor- 
mone signaling pathways during grain filling. 

Consistent colocalization of QTL for MWC, KGR, and KW help 
explain common phenotypic correlations usually observed (Swank 
et al. 1987; Calderini et al. 2000; Borras and Westgate 2006). QTL 
colocalization of correlated traits was independent of the genetic back- 
ground, confirming the common genetic basis previously reported for 
these traits (Alvarez Prado et al. 2013b). These new results confirm 
our hypothesis that correlated traits are likely to be controlled by the 
same gene/s. 

Evaluated populations showed a high number of stable QTL across 
environments. This stability could be associated with the large genetic 
variation respect to genotype x environment interaction variation. 
Almost 20% of the stable QTL also were stable across genetic back- 
grounds but showed changes in effect size. When comparing stable 
QTL with previous results (Alvarez Prado et al. 2013b), we confirmed 
several regions associated with KW determination whereas others 
remained population specific because of background effects. These 
relevant regions also were detected in other studies dealing with maize 
KW (Ribaut et al. 1997; Austin and Lee 1998; Melchinger et al. 1998; 
Capelle et al. 2010; Peng et al. 2011; Guo et al. 2011). Although effect 
sizes may be different, it is important to highlight that several genomic 
regions were always significant for KW determination (Figure 4), and 
these may represent regions relevant for identifying genes responsible 
for natural variation. 

KW and its physiologically related traits are relevant for maize 
grain yield determination. The link between crop physiology and ge- 
netics highlighted in this study provided insight into the genetic ar- 
chitecture of KW and the mechanisms involved in its determination. 
These complex traits are dominated by small additive QTL, several 
epistatic interactions between significant and nonsignificant main ef- 
fect QTL, a few environmental interactions, and medium-to-high back- 
grounds effects. Genetic background effects were significant for all 
evaluated traits. 

Despite several chromosome regions that were stable, effect sizes 
were modified as the genetic background varied. When comparing 
results from this study with previous results, we found that several 
genomic regions were confirmed as stable across environments and 
genetic backgrounds. Future efforts targeting genes within those re- 
gions will help our understanding of maize KW determination. 
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